A Mutual Selection Model for Weighted Networks 
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For most networks, the connection between two nodes is the result of their mutual affinity and 
attachment. In this paper, we propose a mutual selection model to characterize the weighted 
networks. By introducing a general mechanism of mutual selection, the model can produce power- 
law distributions of degree, weight and strength, as confirmed in many real networks. Moreover, we 
also obtained the nontrivial clustering coefficient C, degree assortativity coefficient r and degree- 
strength correlation, depending on a model parameter m. These results are supported by present 
empirical evidences. Studying the degree-dependent average clustering coefficient C{k) and the 
degree-dependent average nearest neighbors' degree knn{k) also provide us with a better description 
of the hierarchies and organizational architecture of weighted networks. 

PACS numbers: 02.50.Le, 05.65.+b, 87.23. Ge, 87.23. Kg 



I. INTRODUCTION 



In the past few years, there has been a great devotion 
of physicists to understand and characterize the under- 
lyingrnechanisms of complex networks, e.g. the Inter- 
net the WWWQ, the scientific collaboration net- 
works (SCN) 13, 0, U and world-wide airport networks 
(wAN)[i, H Ig. Until now, network researchers have 
mainly focused on the topological aspect of graphs, that 
is, unweighted networks. Typically, Barabasi and Albert 
proposed a famous model (BA model) that introduces 
the linear degree preferential attachment mechanism to 
study unweighted growing networks 0. In their model, 
however, if one considers other measures like the cluster- 
ing coefficient then one may conclude that this model is 
still insufficient to describe reality. The hypothesis of a 
linear attachment rate is empirically supported by mea- 
suring different real networks, but the origin of the ubiq- 
uity of the linear preferential attachment is not clear yet. 
Most recently, the availability of more complete empiri- 
cal data has allowed scientists to consider the variation 
of the weights of links that reflect the physical character- 
istics of many real networks. It is well-known that net- 
works are not only specified by their topology but also 
by the dynamics of weight taking place along the links. 
For instance, the heterogeneity in the intensity of connec- 
tions may be very important in understanding network 
systems. The amount of traffic characterizing the con- 
nections of communication systems or large transport in- 
frastructure is fundamental for a full description of these 
networks. Take the WAN for example: each given edge 
weight Wij (traffic) is the number of available seats on 
direct flight connections between the airports i and j. In 
the SCN, the nodes are identified with authors and the 
weight depends on the number of coauthored papers. Ob- 
viously, there is a tendency for a modelling approach to 



networks that goes beyond the purely topological point 
of view. In the light of this need, Alain Barrat, et al. pre- 
sented a model (BBV model) that integrates the topol- 
ogy and weight dynamical evolution to study the growth 
of weighted networks 0|. Their model yields scale-free 
properties of the degree, weight and strength distribu- 
tions, controlled by an introduced parameter 5. How- 
ever, its weight dynamical evolution is triggered only by 
newly added vertices, hardly resulting in satisfying inter- 
pretations to the collaboration networks or the airport 
systems. 

The properties of a graph can be expressed via its ad- 
jacency matrix a^j, whose elements take the value 1 if an 
edge connects the vertex i to the vertex j and otherwise. 
The data contained in the previous data sets permit to 
go beyond this topological representation by defining a 
weighted graph. Weighted networks are often described 
by a weighted adjacency matrix Wij which represents the 
weight on the edge connecting vertices i and j, with 
«, j = 1, . . . , TV, where N is the size of the network. We 
will only consider undirected graphs, where the weights 



are symmetric {wi 



). As confirmed by measure- 



ments, complex networks often exhibit a scale- free degree 
distribution P{k) ~ fc"'^ with 2< 7 <3 HQ. The weight 
distribution P(w) that any given edge has weight w is an- 
other significant characterization of weighted networks, 
and it is found to be heavy tailed, spanning several orders 
of magnitude ■ A natural generalization of connectiv- 
ity in the case of weighted networks is the vertex strength 
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described as Si — X]jGr(i) ' where the sum runs over 
the set r(z) of neighbors of node i. The strength of a 
vertex integrates the information about its connectivity 
and the weights of its links. For instance, the strength 
in WAN provides the actual traffic going through a ver- 
tex and is obvious measure of the size and importance of 
each airport. For the SCN, the strength is a measure of 
scientific productivity since it is equal to the total num- 
ber of publications of any given scientist. This quantity 
is a natural measure of the importance or centrality of a 
vertex in the network. Empirical evidence indicates that 
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in most cases the strength distribution has a fat tail '7|, 
similar to the power law of degree distribution. Highly 
correlated with the degree, the str eng th usually displays 
scale-free property s ~ fc^ 

The previous models of complex networks always incor- 
porate the (degree or strength) preferential attachment 
mechanism, which may result in scale-free properties. Es- 
sentially speaking, this mechanism just describes inter- 
actions between the newly- added node and the old ones. 
The fact is that such interactions also exist between old 
nodes. This perspective has been practised in the work of 
Dorogovtsev and Mendes (DM) who proposed a class 
of undirected and unweighted models where new edges 
are added between old sites (internal edges) and exist- 
ing edges can be removed (edge removal). On the other 
hand, we argue that any connection is a result of mu- 
tual affinity and attachment between nodes, while many 
network models seem to ignore this point. Traditional 
models often present us such an evolution picture: pre- 
existing nodes are passively attached by newly adding 
nodes according to linear degree (or strength) preferen- 
tial mechanism. This picture is just a partial aspect for 
most complex networks. It is worth remarking that the 
creation and reinforcement of internal connections are an 
important aspect for understanding real graphs. 

In this paper, we shall present a model for weighted 
networks that considers the topological evolution under 
the general mechanism of mutual selection and attach- 
ment between vertices. It can mimic the reinforcement 
of internal connections and the evolution of many infras- 
tructure networks. The diversity of scale-free character- 
istics, nontrivial clustering coefficient, assortativity co- 
efficient and nonlinear strength-degree correlation that 
have been empirically observed can be well explained 
by our microscopic mechanisms. Moreover, in contrast 
with previous models where weights are assigned stati- 
cally 0, 01 or rearranged locally , we allow weights 
to be widely updated. 



II. THE MUTUAL SELECTION MODEL 

The model starts from an initial configuration of A^o 
vertices fully connected by links with assigned weight wq. 
The model is defined on two coupled mechanisms: the 
topological growth and the mutual selection dynamics: 

(i) Topological Growth. At each time step, a new ver- 
tex is added with n edges connected to n previously ex- 
isting vertices, choosing preferentially nodes with large 
strength; i.e. a node i is chosen according to the strength 
preferential probability: 

n ■ - (I) 

The weight of each new edge is also fixed to wq. 

ill) Mutual Selection Dynamics. Every existing node i 
selects m other pre-existing nodes for possible connection 



according to the probability: 

n,^, = ^ . (2) 

Considering the normalization requirement and that ver- 
tices are not permitted to connect themselves, the de- 
nominator of Ili^j contains the term — s^. If two un- 
connected nodes are mutually selected, then an internal 
connection is built between them. If two connected nodes 
are mutually selected, then their connection is strength- 
ened; i.e. their edge weight is increased by wq. Here 
the parameter m is the number of candidate vertexes for 
creating or strengthening connections. Later, we will see 
that m also controls the growing speed of the network's 
total strength; for instance, the increasing rate of total 
information in a communication system. 

We argue that connections in most real networks are 
due to the mutual selections and attachments between 
nodes. Take the SCN for example: the collabora- 
tions among scientists require their common interest and 
mutual acknowledgements. Unilateral effort does not 
promise collaboration. Two scientists that both have 
strong scientific potentials (large strengths) and long col- 
laborating history are more likely to publish papers to- 
gether during a certain period. Likewise, for the Movie 
Actor Collaboration Networks (MACN) , two actors that 
both have high popularity, if they co-star, are more prob- 
ably to boost up the box office. So it is reasonable to as- 
sume that each node is more likely to choose those nodes 
with large strengths when building or strengthening con- 
nections. But pre-existing nodes with large strengths will 
not be passively attached by nodes with small strength. 
There are competition and adaptation in such a complex 
systems. Both natural and social networks bear such 
a property or mechanism during their evolutions. The 
above description of our model could satisfactorily ex- 
plain the WAN too. The weight here denotes the relative 
magnitude of the traffic on a flight connection. At the be- 
ginning of the construction of the airport network, the air 
traffic is usually open between metropolises that hold a 
high status in both economy and politics. Once a new air- 
line is created between two airports, it will trigger a more 
intense traffic activity depending on the specific nature of 
the network topology and on the micro dynamics. With 
the improvement of national economy and the expansion 
of population, the air traffic between metropolises will 
increase. Due to their importance, there is an obvious 
need for other cities to build new airports to connect the 
metropolises. Yet, it is reasonable that the traffic be- 
tween metropolises will grow faster than that between 
other cities, each of which holds a lower economical and 
political status and a smaller population that can afford 
airplanes. But due to the limit of energy and resources, 
each node can only afford a limited number of connec- 
tions. Hence facing the vertex pool, they have to choose. 
For instance, in the WAN, an airport can not afford the 
cost of connecting all the other airports. 

The network provides the substrate on which numerous 
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dynamical processes occur. Technology networks provide 
large empirical database that simultaneously captures 
the topology and the dynamics taking place on it. For In- 
ternet, the information flow between routers (nodes) can 
be represented by the corresponding edge weight. The 
total information load that each router deals with can 
be denoted by the node strength, which also represents 
the importance of given router. The increasing informa- 
tion flow as an internal demand always spurs the expan- 
sion of technological networks. Specifically, the largest 
contribution to the growth is given by the emergence of 
links between already existing nodes. This clearly points 
out that the Internet growth is strongly driven by the 
need of a redundancy wiring and an increasing need of 
available bandwidth for data transmission "iS]. On one 
end, newly-built links (between existing routers) are sup- 
posed to preferentially connect high strength routers, be- 
cause otherwise, it would lead to the unnecessary traffic 
congestion along indirect paths that connect those high 
strength nodes. Naturally, information traffic along ex- 
isting links between high strength routers, in general, in- 
creases faster than that between low strength routers. 
This phenomenon could be reproduced in our model too. 
On the other end, new routers preferentially connect to 
routers with larger bandwidth and traffic handling ca- 
pabilities (the strength driven attachment). This char- 
acteristic also exists in airport system, power grid, and 
railroad network; and they could be explained by our 
mechanisms. 



III. PROBABILITY DISTRIBUTIONS AND 
STRENGTH EVOLUTION 



The network growth starts from an initial seed of A^o 
nodes, and continues with the addition of one node per 
unit time, until a size N is reached. Hence the model 
time is measured with respect to the number of nodes 
added to the graph, i.e. t — N ~ Nq, and the natural 
time scale of the model dynamics is the network size N. 
Using the continuous approximation, we can treat k, w, 
s and the time t as continuous variables P, Q . Then the 
edge weight Wij is updated as this evolution equation: 



equation of strength i can be written as below: 



'dt 



E '-'""»j 
dt 



+ n X 
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(m^ -|- n) 



Si 



(4) 



This equation may be written in a more compact form 
by noticing that 



^^dt + ntfii{m^ + 2n)t, (5) 



feeA 



where A represents the set of existing nodes at time step 
t. By plugging this result into the equation (4), we obtain 
the following strength dynamical equation 



dsj 
'dt 



2n t 



(6) 



which can be readily integrated with initial conditions 
Si{t — i) — n, yielding 



s^(^) 



(7) 



The equation J^i ~ (™^ + 2n)t also indicates that the 
total strength of the vertices in statistical sense is uni- 
formly increased with the size of network. As one see, 
the growing speed of the network's total strength load is 
mainly determined by the model parameter m. 

The knowledge of the time-evolution of the various 
quantities allows us to compute their statistical proper- 
ties. Indeed, the time ti = t at which the node i enters 
the network is uniformly distributed in [0,t] and the de- 
gree probability distribution can be written as 



P(s,<) 



1 



t + No 



6{s - Si{t))dti 



(8) 







where d{x) is the Dirac delta function. Using equation 
Si{t) ~ {t/iY) obtained from Eq. (7), one obtains in the 
infinite size limit t oo the distribution P(s) ~ s" with 
a=l + 1/6: 



dwij 
~dt 



Sj Si 

— X m— 

S^ Lfc Sfe - 

m^SiSj 

{T,k^k - Sj){J^,^Sk~ Si)' 



(3) 



There are two processes that contribute to the increment 
of strength s^. One is the creation or reinforcement of 
internal connections incident with node i, the other is 
the attachment to i by newly added node. So the rate 



a = 2 + n/{m^ + n). 



(9) 



Obviously, when m = the model is topologically equiv- 
alent to the BA network and the value a = 3 is recov- 
ered. For larger values of to, the distribution is gradually 
broader with a — > 2 when to — > cx). 

We performed numerical simulations of networks gen- 
erated by choosing different values of to and fixing n — 5 
and uiq = 1. Considering that every vertex strength can 
at most increase by m from internal connections and a 
newly added node can at most connect with n existing 



4 




FIG. 1: Probability distribution P{s). Data are consistent 
with a power-law behavior s^". In the inset we give the value 
of a obtained by data fitting (filled circles), together with the 



analytical expression a = 2 + n/{m + 



2 + + 



5) (line). The data are averaged over 10 independent runs of 
network size N=5000. 




FIG. 2: Evolution of strength of vertices during the growth 
of network for various of m. In the inset we give the value 
of 6 obtained by data fitting (filled circles) , together with the 

2 I 

analytical expression d — *" 



^^Vri7T(line) 



nodes, it is easy to see that the initial network configura- 
tion must satisfy A'o = max(TO-|- l,n). So for example, if 
m = 10, then A^o = H- We have checked that the scale- 
free properties of our model networks are independent of 
the initial conditions. Numerical simulations are consis- 
tent with our theoretical predictions, verifying again the 
reliability of our present results. Fig. 1 gives the proba- 
bility distribution P{s) s", which is in excellent agree- 
ment with the theoretical predictions. In Fig. 2 we show 
the behavior of the vertices' strength versus time for dif- 
ferent values of m, recovering the behavior predicted by 
analytical methods. We also report the average strength 



FIG. 3: Strength Si versus ki for different m (log- log scale). 
Linear data fitting gives slope 1.04, 1.16, 1.26 and 1.41 (from 
bottom to top), demonstrating the correlation of s ~ fc''. 




FIG. 4: Probability distribution of the degrees P{k) ~ k ' 
for different m. The data are averaged over 10 independent 
runs of network size N=5000. 



Si of vertices with degree ki, which displays a nontriv- 
ial power-law behavior s ~ A:'' as confirmed by empirical 
measurement. Unlike BBV networks (where f3 — 1), the 
exponent P here varies with the parameter to in a non- 
trivial way as shown in Fig. 3. The nontrivial s 
correlation demonstrates the significant part of weight 
increment along existing edges. More importantly, one 
could check the scale-free property of degree distribution 
(P(fc) ~ k~^) by combining s ^ k^ with P{s) ~ s~". 
Considering the conservation of probability 



P{k)dk 



P{s)ds, 



we can readily calculate the exponent 7 
ds 



P{k) = P{s) 



dk 



s-"l3k 



0-1 



fik' 



(/3(a-l) + l) 



(10) 



(11) 
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restricted to classes of vertices with degree k: 




FIG. 5: Probability distribution of the weights P{w) 

for various m. The data are averaged over 10 independent 

runs of network size N=5000. 



giving 7 = l3{a — 1) + 1. The scale-free properties of de- 
gree and weight obtained from simulations are presented 
in Fig. 4 and Fig. 5, respectively. The simulation consis- 
tence of scale- free properties indicates that our model can 
indeed produce power-law distributions of degree, weight 
and strength. In this case, the numerical simulations of 
the model reproduce the behaviors predicted by the an- 
alytical calculations. 



IV. CLUSTERING AND CORRELATION 

Many real networks in nature and society share two 
generic properties: they are scale-free and they display a 
high degree of clustering. Along with the general vertices 
hierarchy imposed by the scale- free strength distribution, 
complex networks show an architecture imposed by the 
structural and administrative organization of these sys- 
tems, which is mathematically encoded in the various 
correlations existing among the properties of different 
vertices. For this reason, a set of topological and weighed 
quantities are customarily studied in order to uncover the 
network architecture. A first and widely used quantity 
is given by the clustering of vertices. The clustering of a 
vertex i is defined as 



ki {k, 



1 ^ 

' ^ ) ■ I. 



(12) 



and measures the local cohesiveness of the network in 
the neighborhood of the vertex. Indeed, it yields the 
fraction of inter-connected neighbors of a given vertex. 
The average over all vertices gives the network clustering 
coefficient which describes the statistics of the density 
of connected triples. Further information can be gath- 
ered by inspecting the average clustering coefficient C{j) 



^ ' i,ki=k 



(13) 



In many networks, the average clustering coefficient C(fc) 
exhibits a highly nontrivial behavior with a power-law 
decay as a function of k 15], indicating that low-degree 
nodes generally belong to well interconnected communi- 
ties (high clustering coefficient) while high-degree sites 
are linked to many nodes that may belong to different 
groups which are not directly connected (small clustering 
coefficient). This is generally the signature of a nontriv- 
ial architecture in which hubs (high degree vertices) play 
a distinct role in the network. Numerical simulations in- 
dicate that for large m, the clustering coefficient C{N) is 
almost independent of N (as we can see in Fig. 6), which 
agrees with the finding in several real networks 9] . Gen- 
erally, when the network size N is larger than 5000, the 
clustering coefficient is nearly stable. So most computer 
runs are assigned with 5000. Still, it is worth remarking 
that for the BA networks, C{N) is nearly zero, far from 
the practical nets that exhibit a variety of small-world 
properties. In the present model, however, clustering co- 
efficient C is fortunately found to be a function of m 
(see Fig. 7), also supported by empirical data in a broad 
range. Finally, the clustering coefficient C{k) depending 
on connectivity k for increasing m is also interesting and 
shown in Fig. 8. For clarity, we add the dashed line 
with slope —1 in the log- log scale. This simulation re- 
sults are supported by recent empirical measurements in 
many real networks. For the convenience of comparison 
with Fig. 8, we use two figures from Ref. [17] as our 
Fig. 9, from which one can see the agreements of theory 
and experiment are quite excellent. Empirical evidence 
generally supports the simulations of clusterin g-degree 
correlation. Though some previous models[i3, l2Ui UM 
can generate the power-law decay of the clustering-degree 
correlation, none of them as far as we know can produce 
the flat head as found in real graphs. This is a special 
property that our model successfully behaves. 

Another important source of information is the corre- 
lations of the degree of neighboring vertices. The average 
nearest neighbor degree is proposed to measure these cor- 
relations 



(14) 



Once averaged over classes of vertices with connectivity 
k, the average nearest neighbor degree can be expressed 
as 



A:„„(fc) = ^P(fc'|fc), 



(15) 



providing a probe on the degree correlation function. If 
degrees of neighboring vertices are uncorrelated, P{k'\k) 
is only a function of k' and thus knn{k) is a constant. 
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FIG. 6: The evolution of clustering coefficient (or C versus 
A^) which converges soon. 



FIG. 8: The clustering coefficient C(k) depending on con- 
nectivity k for increasing m. For comparison, the dashed line 
has slope -1 in the log- log scale. 




FIG. 7: Clustering coefficient C depending on the parameter 
m. 



When correlations are present, two main classes of pos- 
sible correlations have been identified: assortative be- 
havior if knn{k) increases with k, which indicates that 
large degree vertices are preferentially connected with 
other large degree vertices, and dis assortative if fc„„(fc) 
decreases with k. The above quantities provide clear sig- 
nals of a structural organization of networks in which 
different degree classes show different properties in the 
local connectivity structure. In the light of this mea- 
sure, we also perform computer simulations to test the 
knn{k) — k correlation, as shown in Fig. 10. As fc„„(fc) 
decreases with fc, one may find that our model can best 
illustrate disassortative networks in reality, that mainly 
is, technological networks (e.g. Internet, WAN) and bio- 
logical networks (e.g. Protein Folding Networks). As for 
the social networks, connections between people may be 
assortative by language or by race. Newman proposed 
some simpler measures to describe these types of mix- 




t 10 

k 



FIG. 9: The scaling of C(k) with k for two real networks 

(a) Actor network, two actors being connected if they acted in 
the same movie according to the www.IMDB.com database. 

(b) The semantic web, connecting two English words if they 
are listed as synonyms in the Merriam Webster dictionary. 
The dashed line in each figure has slope -1. 



4000 6000 

N 



FIG. 10: Average connectivity k„„{k) of the nearest neigh- FIG. 11: Degree-degree correlation r depending on A''. The 

bors of a node depending on its connectivity k for different evolution of r converges soon. 

m. 



ing, which we call assortativity coefficients Almost 
all the social networks studied show positive assortativ- 
ity coefficients while all others, including technological 
and biological networks, show negative coefficients. It is 
not clear if this is a universal property; the origin of this 
difference is not understood either. In our views, it repre- 
sents a feature that should be addressed in each network 
type individually. In the following, we use the formula 
proposed by Newman in Ref . |22|| , 

where ji, ki are the degrees of vertices at the ends of the 
ith edges, with i = 1, ...,M [M is the total number of 
edges in the observed graph). We calculate the degree 
assortativity coefficient (or degree-degree correlation) r 
of the graphs generated by our model. For large N (e.g. 
N > 5000), the degree-degree correlation r is almost in- 
dependent of the network size (see Fig. 11). Simulations 
of r depending on m are given in Fig. 12 and supported 
by^empirical measurements for disassortative networks 

V. CONCLUSION AND OUTLOOK 

In sum, integrating the mutual selection mechanism 
between nodes and the growth of strength preferential 
attachment, our network model provides a wide variety 
of scale-free behaviors, tunable clustering coefficient and 
nontrivial (degree-degree and strength-degree) correla- 
tions, just depending on the parameter m which gov- 
erns the total weight growth. All the results of net- 
work properties are found supported by various em- 
pirical data. Interestingly and specially, studying the 
degree-dependent average clustering coefficient C{k) and 
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FIG. 12: Degree-degree correlation r depending on m. 



the degree-dependent average nearest neighbors' degree 
knn{k) also provide us with a better description of the 
hierarchies and organizational architecture of weighted 
networks. As far as our knowledge, there at present ap- 
pears no model that could generate so many properties 
which are in broad agreement with empirical data. Thus, 
our model may be very beneficial for future understand- 
ing or characterizing real networks. Though our model 
can just produce disassortative networks (most suitable 
for technological and biological ones), which means one 
of its limitations, we always expect some model versions 
or variations that generate weighted networks with as- 
sortative property. Due to the apparent simplicity of our 
model and the variety of tunable results, we believe that 
some of its extensions will probably help address (e.g. 
social) networks. Therefore, our present model for all 
practical purposes will demonstrate its applications in 
future weighted network research. 
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